Efficient techniques for genotype‐phenotype correlational analysis

نویسندگان

Subrata Saha

Sanguthevar Rajasekaran

Jinbo Bi

Sudipta Pathak

چکیده

BACKGROUND Single Nucleotide Polymorphisms (SNPs) are sequence variations found in individuals at some specific points in the genomic sequence. As SNPs are highly conserved throughout evolution and within a population, the map of SNPs serves as an excellent genotypic marker. Conventional SNPs analysis mechanisms suffer from large run times, inefficient memory usage, and frequent overestimation. In this paper, we propose efficient, scalable, and reliable algorithms to select a small subset of SNPs from a large set of SNPs which can together be employed to perform phenotypic classification. METHODS Our algorithms exploit the techniques of gene selection and random projections to identify a meaningful subset of SNPs. To the best of our knowledge, these techniques have not been employed before in the context of genotype-phenotype correlations. Random projections are used to project the input data into a lower dimensional space (closely preserving distances). Gene selection is then applied on the projected data to identify a subset of the most relevant SNPs. RESULTS We have compared the performance of our algorithms with one of the currently known best algorithms called Multifactor Dimensionality Reduction (MDR), and Principal Component Analysis (PCA) technique. Experimental results demonstrate that our algorithms are superior in terms of accuracy as well as run time. CONCLUSIONS In our proposed techniques, random projection is used to map data from a high dimensional space to a lower dimensional space, and thus overcomes the curse of dimensionality problem. From this space of reduced dimension, we select the best subset of attributes. It is a unique mechanism in the domain of SNPs analysis, and to the best of our knowledge it is not employed before. As revealed by our experimental results, our proposed techniques offer the potential of high accuracies while keeping the run times low.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

kidd blood group genotyping in alloimmunized thallasemia patients

Abstract Background and Objectives Hemagglutination has limitations in identifying the phenotype of patients who have been recently transfused due to the presence of donor red cells (RBCs) in the patient’s circulation. Kidd blood group is one of the most important blood groups in transfusion medicine and related antibodies are responsible for one third of delayed haemolytic transfusion reactio...

متن کامل

Chromosome Duplication (14q) and The Genotype Phenotype Correlation

متن کامل

Using a GA to Determine Genotype and Phenotype Relationships

A method using genetic algorithms is being proposed to help advance the analysis of genotype and phenotype relationships so as to help in the determination of which genes contribute to a particular disease or condition. The genetic algorithm has potential in mapping out the genotype-phenotype relationship in a computationally efficient way. The GAGENES package (Meli, 2006) was used for this pur...

متن کامل

Cytogenetic genotype-phenotype studies: improving genotyping, phenotyping and data storage.

High-resolution molecular cytogenetic techniques such as genomic array CGH and MLPA detect submicroscopic chromosome aberrations in patients with unexplained mental retardation. These techniques rapidly change the practice of cytogenetic testing. Additionally, these techniques may improve genotype-phenotype studies of patients with microscopically visible chromosome aberrations, such as Wolf-Hi...

متن کامل

Molecular Diagnosis of Familial Hypercholesterolemia

Abstract Background and objectives: Familial hypercholesterolemia (FH) is an autosomal disorder characterized by increased levels of total cholesterol and low density lipoprotein cholesterol. The FH clinical phenotype has been associated with increased risk of coronary heart disease and premature death. The mutation in LDLR gene in most cases is responsible for FH phenotype. Furthermore, other ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 13 شماره

صفحات -

تاریخ انتشار 2013

Efficient techniques for genotype‐phenotype correlational analysis

نویسندگان

چکیده

منابع مشابه

kidd blood group genotyping in alloimmunized thallasemia patients

Chromosome Duplication (14q) and The Genotype Phenotype Correlation

Using a GA to Determine Genotype and Phenotype Relationships

Cytogenetic genotype-phenotype studies: improving genotyping, phenotyping and data storage.

Molecular Diagnosis of Familial Hypercholesterolemia

عنوان ژورنال:

اشتراک گذاری